Research Paper: Evaluation of a Method to Identify and Categorize Section Headers in Clinical Documents

نویسندگان

  • Joshua C. Denny
  • Anderson Spickard
  • Kevin B. Johnson
  • Neeraja B. Peterson
  • Josh F. Peterson
  • Randolph A. Miller
چکیده

OBJECTIVE Clinical notes, typically written in natural language, often contain substructure that divides them into sections, such as "History of Present Illness" or "Family Medical History." The authors designed and evaluated an algorithm ("SecTag") to identify both labeled and unlabeled (implied) note section headers in "history and physical examination" documents ("H&P notes"). DESIGN The SecTag algorithm uses a combination of natural language processing techniques, word variant recognition with spelling correction, terminology-based rules, and naive Bayesian scoring methods to identify note section headers. Eleven physicians evaluated SecTag's performance on 319 randomly chosen H&P notes. MEASUREMENTS The primary outcomes were the algorithm's recall and precision in identifying all document sections and a predefined list of twenty-nine major sections. A secondary outcome was to evaluate the algorithm's ability to recognize the correct start and end boundaries of identified sections. RESULTS The SecTag algorithm identified 16,036 total sections and 7,858 major sections. Physician evaluators classified 15,329 as true positives and identified 160 sections omitted by SecTag. The recall and precision of the SecTag algorithm were 99.0 and 95.6% for all sections, 98.6 and 96.2% for major sections, and 96.6 and 86.8% for unlabeled sections. The algorithm determined the correct starting and ending text boundaries for 94.8% of labeled sections and 85.9% of unlabeled sections. CONCLUSIONS The SecTag algorithm accurately identified both labeled and unlabeled sections in history and physical documents. This type of algorithm may assist in natural language processing applications, such as clinical decision support systems or competency assessment for medical trainees.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Design and Validation of Environmental Curriculum Framework Based on Upstream documents in Middle school

The purpose of this study was to design and validate the environmental curriculum framework based on the upstream documents in the middle school, which was carried out using a qualitative-quantitative method with exploratory design in two sections. The research population in the first section of the research was upstream documents. The sampling method from this population was criterion-based. T...

متن کامل

Presenting a Framework for Supporting Life-long Learning in Iranian public libraries and Its validation

Purpose: Since nowadays public libraries are considered lifelong learning centers, these centers must have the required standards and conditions to support lifelong learning in order that they could help society members to achieve their personal and professional learning more effectively. Accordingly, it is necessary to develop and provide a mechanism to support lifelong learning in public libr...

متن کامل

A Machine Learning Approach to Identifying Sections in Legal Briefs

With an abundance of legal documents now available in electronic format, legal scholars and practitioners are in need of systems able to search and quantify semantic details of these documents. A key challenge facing designers of such systems, however, is that the majority of these documents are natural language streams lacking formal structure or other explicit semantic information. In this re...

متن کامل

The Attitude of Academic Experts to the University Interaction with Society

Attention to the university's interaction with society, along with the two main functions of the higher education system, namely; Education and research implies the social role of universities in responding to the needs and expectations of society at different levels. Based on this, the current research aims to identify and categorize the attitude of university experts regarding the upcoming ch...

متن کامل

Designing a Combined-fuzzy Methodology to Improve Organizational Diagnosis Process Effectiveness through Identification and Assessment of Effective Parameters

Organizational diagnosis is a systematic and scientific method to identify, categorize and single out the obstacles and their impact on organizational performance through interaction between internal and external views and preparation and setting up operational plans to solve them in the organization. Providing standard products and emphasizing on the financial measures do not guarantee the sur...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Journal of the American Medical Informatics Association : JAMIA

دوره 16 6  شماره 

صفحات  -

تاریخ انتشار 2009